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Abstract 

A convex body K in R" has associated with it a unique circumscribed ellipsoid 
CE(K) with minimum volume, and a unique inscribed ellipsoid IE(K) with maximum 
volume. We first give a unified, modern exposition of the basic theory of these extremal 
ellipsoids using the semi-infinite programming approach pioneered by Fritz John in his 
seminal 1948 paper. We then investigate the automorphism groups of convex bodies 
and their extremal ellipsoids. We show that if the automorphism group of a convex 
body K is large enough, then it is possible to determine the extremal ellipsoids CE(if) 
and IE(if) exactly, using either semi-infinite programming or nonlinear programming. 
As examples, we compute the extremal ellipsoids when the convex body K is the part 
of a given ellipsoid between two parallel hyperplanes, and when K is a truncated second 
order cone or an ellipsoidal cylinder. 
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1 Introduction 



A convex body in R n is a compact convex set with nonempty interior. Let K be a convex 
body in M. n . Among the ellipsoids circumscribing A, there exists a unique one with mini- 
mum volume and similarly, among the ellipsoids inscribed in K, there exists a unique one 
of maximum volume. These are called the minimal circumscribed ellipsoid and maximal 
inscribed ellipsoid of A, and we denote them by CE(iC) and IE(K), respectively. To our 
knowledge, Behrend [2j is the first person to investigate these problems, and proves the 
existence and uniqueness of the two ellipsoids in the plane, that is when n — 2. In the gen- 
eral case, the existence of either ellipsoid is easy to prove using compactness. The ellipsoid 
CE(AT) is often referred to as Lowner ellipsoid since Lowner has used its uniqueness in his 
lectures, see [10]. The uniqueness of CE(A) also follows from the famous paper of John [18] 
although he does not state it explicitly. Subsequently, Danzer, Laugwitz, and Lenz [11] . and 
Zaguskin [39] prove the uniqueness of both ellipsoids. The inscribed ellipsoid problem IE(A) 
is often called John ellipsoid, and sometimes Lowner-John ellipsoid, especially in Banach 
space geometry literature. This designation may seem inappropriate, since John does not 
consider the problem IE(if) in 18J. However, in Banach space geometry literature, one is 
mainly interested in symmetric convex bodies (A = —K), and for this class of convex bodies 
the inscribed and circumscribed ellipsoids are related by polarity, so that results about one 
ellipsoid may be translated into a similar statement about the other. 

The two extremal problems CE(A) and IE (A) are important in several fields includ- 
ing optimization, Banach space geometry, and statistics. They also have applications in 
differential geometry [24] . Lie group theory [11] , and symplectic geometry, among others. 
The ellipsoid algorithm of Khachian [17 for linear programming sparked general interest 
of optimizers in the circumscribed ellipsoid problem. At the fcth step of this algorithm, 
one has an ellipsoid and is interested in finding = CE(A") where A is the 

intersection of with a half plane whose bounding hyperplane passes through the cen- 
ter of E {k \ The ellipsoid E<- k+ ^ can be computed explicitly, and the ratio of the volumes 
vol(E^ k+1 ')/ vol(i?( fc )) determines the rate of convergence of the algorithm and gives its 
polymomial-time complexity. From this perspective, ellipsoids covering the intersection of 
an ellipsoid with two halfspaces whose bounding hyperplanes are parallel have been studied 
as well, since the resulting ellipsoid algorithms are likely to have faster convergence. The 
papers of Konig and Pallaschke [23] and Todd [36] compute the circumscribing ellipsoid 
explicitly. It has been discovered that the circumscribed ellipsoid problem is dually related 
to the optimal design problem in statistics. Consequently, there has been wide interest in 
the problem CE(A) and related problems in this community as well. Inscribed ellipsoid 
problems also arise in optimization. For example, the inscribed ellipsoid method of Tarasov, 
Khachian, and Erlikh [35] is a polynomial-time algorithm for solving general convex pro- 
gramming problems. In this method, one needs to compute numerically an approximation 
to the inscribed ellipsoid of a polytope which is described by a set of linear inequalities. 

To meet the demand from different fields such as optimization, computer science, engi- 
neering, and statistics, there have been many algorithms proposed to numerically compute 
the two extremal ellipsoids CE(A) and IE(A). Recently, there has been a surge of research 
activity in this subject. We do not discuss algorithms in this paper, but the interested reader 
can find more information on this topic and the relevant references in the papers [41] , [34] , 
and [55] , 

This paper has three, related goals. In S[2] and $3] we give semi-infinite programming 
formulations of the problems CE(A) and IE(K), respectively, and then describe their fun- 
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damental properties using the resulting optimality conditions. There are essentially no new 
results in these sections. We include them in order to fill a gap in the optimization liter- 
ature, and we also use some of the results in these sections later on in the paper. There 
exists a sizable literature in Banach space theory dealing with the existence, uniqueness, 
and basic properties of the extremal ellipsoids CE(K') and IE(K). They use (necessarily) 
optimality considerations to arrive at the results they are interested in, but this is done in 
an informal manner. In fact, many of the papers in this literature use the reformulation 
of the ellipsoid problems given in the interesting paper of Lewis [26]; see also [29], [37] . 
There have been some exceptions recently. See for example the interesting papers Gordon 
et al. [13] and Klartag [22 . To our knowledge, the only papers which systematically use 
optimization techniques to prove results about the extremal ellipsoids are the original paper 
of John [18] and the papers of Juhnke [19, 20, 21j. What we offer in § [2] and $3] is a careful, 
unified, and modern synthesis of the basic results on the ellipsoids GE(K) and IE(K) in 
the mainstream optimization literature. We hope that it serves as a useful introduction to 
the subject in the optimization community. 

We devote £|4] to the symmetry properties of convex bodies and the related symmetry 
properties of the corresponding ellipsoids GFi(K) and IE(K). One way to formalize the 
symmetry properties of a convex body K is to consider its ( affine ) automorphism group 
Aut(K). It will be seen that the uniqueness of the two ellipsoids imply that the ellipsoids 
inherit the symmetry properties of the underlying convex body K. That is, Aut(K) is 
contained in the automorphism group of the two extremal ellipsoids. One consequence of 
this is that if K is "symmetric" enough, then either it is possible to anaytically compute the 
extremal ellipsoids exactly, or else it is possible to reduce the complexity of their numerical 
computation. In this section, we demonstrate the former possibility for a class of convex 
bodies K whose automorphism groups act transitively on ext(K ), the extreme points of K . 
Davies [12] shows that for this class of convex bodies, the center of gravity, the center of 
CE(K), and the center of IE(K) all coincide, and this center can be obtained explicitly as 
a Haar integral over the automorphism group Aut(K). We show that the matrix X in the 
circumscribed ellipsoid CE(i^) can also be obtained in a similar manner. We remark that 
there has been a continuous interest on symmetric convex bodies since antiquity, and the 
class of regular polytopes [9] , including the Platonic solids, is a subclass of symmetric convex 
bodies in the sense of Davies. Because of space considerations, we are not able to pursue 
the study of the automorphism groups of convex bodies in greater depth in this paper. We 
plan to explore this subject in future papers. 

In the rest of the paper, starting with ^5] we exploit automorphism groups to analytically 
compute the extremal ellipsoids of two classes of convex bodies. The first class consists of 
convex bodies which are the intersections of a given ellipsoid with two halfspaces whose 
bounding hyperplanes are parallel, and have been mentioned above. We call such convex 
bodies slabs. The second class consists of convex bodies obtained by taking the convex 
hull of the intersection of the same parallel hyperplanes with the ellipsoid. We note that a 
convex body in this class is either a truncated second order cone or an ellipsoidal cylinder, 
depending on the location of the bounding hyperplanes with respect to the center of the 
ellipsoid. 

In <j5j we compute the autormorphism group of a slab K and use it to determine the 
form of the center and matrix of its ellipsoid CE(K). Although the automorphism group 
Aut(K) is not large enough to compute the ellipsoid CE(if) exactly, it is large enough to 
reduce its determination to computing just three parameters (instead of n(n + 3)/2 in the 
general case), one to determine its center and two to determine its matrix. 
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In <|6j we formulate the CE(K) problem for a slab as a semi-infinite programming 
problem, and obtain its solution by computing the three parameters of CE(if ) directly 
from the Fritz John optimality conditions for the semi-infinite program. As we mentioned 
already, Konig and Pallaschke [53j and Todd [35] solve this exact problem. Konig and 
Pallaschke's approach is similar to ours: they use the uniqueness and invariance properties 
of the ellipsoid CE(if). However, their solution is not complete since they only consider 
the cases when the slab does not contain the center of the given ellipsoid. Todd gives a 
complete proof covering all cases. His proof is based on guessing the optimal ellipsoid and 
then proving its minimality by using some bounds on the volume of a covering ellipsoid. In 

we also formulate the ellipsoid problem CE(if) as a nonlinear programming problem, 
and give a second, independent solution for it. 

For interesting applications of the ellipsoid CE(K) of a slab, see the papers [27] and [3j. 

In 33 we formulate the IE(K ) problem for a slab as a semi-infinite programming prob- 
lem, and obtain its solution directly from the resulting Fritz John optimality conditions. We 
also formulate the same problem as a nonlinear programming problem, but do not provide 
its solution in order to keep the the length of the paper within reasonable bounds. 

Finally, in $8] we formulate the CE(K) problem for a convex body from the second class 
of convex bodies mentioned above as a semi-infinite programming problem, and obtain its 
solution directly from Fritz John optimality conditions. The form of the optimal ellipsoid 
CFi(K) turns out to be very similar to the form of the corresponding ellipsoid for a slab. 
We do not solve the inscribed ellipsoid problem for the second class of convex bodies for 
space considerations. 

We remark that the ideas and techniques used in this paper for determining the ex- 
tremal ellipsoids for specific classes of convex bodies can be generalized to other classes of 
convex bodies as long as these bodies have large enough automorphism groups. It is rea- 
sonable to expect that automorphism groups can also be used advantageously in numerical 
determination of extremal ellipsoids. 

Our notation is fairly standard. We denote the set of symmetric n x n matrices by 
§R nxrl . In R™, we use the bracket notation for inner products, thus (u,v) — u T v. In the 
vector space R™ xn ofnxn matrices (and hence in §R" xn ), we use the trace inner product 

(X,Y) =tr(XY T ). 

If both inner products are used within the same equation, then the meaning of each inner 
product should be clear from the context. We define and use additional inner products in 
this paper, especially in $4] The sets dX and conv(X), and ext(X) denote the boundary 
and the convex hull of a set X in R™, respectively, and ext(K) is the set of extreme points 
of a convex set K in R™. 

2 The minimum volume circumscribed ellipsoid prob- 
lem 

We recall that the circumscribed ellipsoid problem is the problem of finding a minimum 
volume ellipsoid circumscribing a convex body K in R™. This is the main problem treated 
in Fritz John |18| . In this paper, John shows that such an ellipsoid exists and is unique; we 
denote it by CF,(K). John introduces semi-infinite programming and develops his optimality 



conditions to prove the following deep result about the ellipsoid CE(K): the ellipsoid with 
the same center as GE(K) but shrunk by a factor n is contained in K 1 and if K is symmetric 
(K = —K), then CE(K) needs to be shrunk by a smaller factor yfn to be contained in K. 
This fact is very important in the geometric theory of Banach spaces. In that theory, a 
symmetric convex body K is the unit ball of a Banach space, and if K is an ellipsoid, then 
the Banach space is a Hilbert space. Consequently, the shrinkage factor indicates how close 
the Banach space is to being a Hilbert space. In this context, it is not important to compute 
the exact ellipsoid GE(K). However, in some convex programming algorithms, including the 
ellipsoid method and its variants, the exact or nearly exact ellipsoid CE(K) needs to be 
computed. If K is sufficiently simple, CE(K ) can be computed analytically. In more general 
cases, numerical algorithms have been developed to approximately compute CE(K). 

In this section, we deal with the CE(if ) problem more or less following John's approach. 
However, in the interest of brevity and clarity, we use more modern notation and give new 
and simpler proofs for some of the technical results. 

An ellipsoid E in R™ is an affine image of the unit ball B n := {u £ W 1 : \ \u\ \ < 1}, that 

is, 

E = c + A(B n ) = {c + Au : u £ E", |M| = 1} C E m , (2.1) 

where A S E mx ™ is any m x n matrix. Here c is the center of E and the volume of E is 
given by vol(-E') = det(A) vol(B n ). We are interested in the case where E is a solid body 
(E has a non-empty interior), hence we assume that A is a non-singular n x n matrix. 
Let A have the singular value decomposition A = V1SV2 where V\, V2 are orthogonal 
n x n matrices and £ is a diagonal matrix with positive elements. Then we have the polar 
decomposition of A, that is, A = SO where S = V\AVi £ §R nx " is positive definite and 
O = V1V2 is an orthogonal matrix. Consequently, E = c + SO(B n ) = c + S(B n ), that is, 
the matrix A in the definition of the ellipsoid E in (|2.1|) can be taken to be symmetric and 
positive definite, an assumption we make from here on. By making the change of variables 
x := c + Au, that is, u = A~ x (x — c), and defining X := A~ 2 , the ellipsoid E in (12. ip . 
E = {x £ E" : 1 1 A - 1 (rt: — c)|| 2 < 1} can also be written in the form 

E = E(X, c) := {x £ E™ : (X(x -c),x~c) < 1}. (2.2) 

Note that we have 

vol(E) = det{X)- 1/2 u n , (2.3) 

where ui n = vol(B n ). 

Consequently, we can set up the circumscribed ellipsoid problem as a semi-infinite pro- 
gram 

min — log det X 

(2.4) 

s.t. (X(y-c),y-c)<l, Vy£K, y 1 

in which the decision variables are X £ §M n>< ™ and c £ R™. 

There exists an ellipsoid of minimum volume circumscribing the convex body K. It 
suffices to prove that the set of feasible (X, c) in problem (|2.4|) is compact. Let K contain 
a ball of radius r > 0, and let E — E(X,c) be an ellipsoid covering K. Note that the ball 
still lies in E if we shift its center to center c of the ellipsoid E. Thus, every vector u in E™, 
||u|| = 1 must satisfy the inequality (Xu, u) < 1/r 2 , that is, the eigenvalues of X are at 
most 1/r 2 . It follows from the spectral decomposition of symmetric matrices that the set of 
feasible X is compact. If the norm of the center c of the ellipsoid E circumscribing K goes 
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to infinity, then the volume of the ellipsoid goes to infinity as well. This proves that the set 
of feasible (X, c) is compact. 

The following basic theorem of Fritz John jTH] is one of our main tools in this paper. The 
book [16] develops optimality conditions for semi-infinite programming including this result 
and treats several problems from analysis and geometry using semi-infinite programming 
techniques. 

Theorem 2.1. (Fritz John) Consider the optimization problem 

min f(x) 

(2 5) 

s.t. g(x,y)<0, VyEY, 

where fix) is a continuously differentiable function defined on an open set X C R™ ; and 
gix,y) and V x gix,y) are continuous functions defined on X x Y where Y is a compact 
set in some topological space. If x is a local minimizer of (|2.5[) . then there exist at most 
n active constraints (g(x,yi) — 0) and a non-trivial, non-negative multiplier 

vector ^ (Ao, Ai, . . . , A&) > such that 

fc 

X Vf(x) + h^ x g{x, y % ) = 0. 
i=i 

We now derive the optimality conditions for (|2.4[) . 

Theorem 2.2. Let K be a convex body in W 1 . There exists an ellipsoid of minimum volume 
circumscribing K. If E(X,c) is such an ellipsoid, then there exists a multiplier vector A = 
(Ai, . . . , Afc) > 0, < k < n(n + 3)/2, and points {ui]\ in K such that 

k 

X^ 1 = ^2 ^i( u i - C )i U i - C ) T > 
i=l 

= 5>( Ui -c), (2-6) 

i=i 

u,e3ifn9£(I,c), i = l,...,k, 
K C E(X, c). 

We call the points {ui]\ in dK n dE contact points of K and E. 

Proof. The existence of a minimum volume circumscribed ellipsoid is already proved above. 
Let E(X, c) be such an ellipsoid. The constraints in (|2.4p are indexed by y S K, a compact 
set, and Theorem l2 . 1 1 implies that there exists a non-zero multiplier vector (Ao, Ai, . . . , Xk) > 
0, where k < n(n + l)/2 + n = n(n + 3)/2, Ai > for i > 0, and points {ui}\ in K such 
that the Lagrangian function 

fc 

c, A) := — Ao logdet X + Xi(X(v,i — c), Ui — c) 

i=l 

k 

= -A logdetX + (X,y^Ai(iij - c)(ui - c) T ), 
j=i 
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where the inner product on the last line is the trace inner product on S n , satisfies the 
optimality conditions 

k 

Q = V c L{X,c,\)=X^Xi{ui-c), 
i=i 

k 

= W X L(X, c, A) = -AoX- 1 + £ K{ Ul - c)(Ui - c) T , 

i=l 

where we used the well-known fact that V^logdetX = X^ 1 . If Ao > 0, then = 
tr(^j = i Aj(uj — c)(ui — c) T ) = Ai||wi — c|| 2 . This implies that Ai = for all i, con- 

tradicting A ^ 0. We let Ao = 1 without loss of generality, and arrive at the Fritz John 
conditions (|2.6|) . □ 

Remark 2.3. The contact points have applications in several fields, in optimal designs, 
and in estimating the size of almost orthogonal submatrices of orthogonal matrices [33j . 
for example. Gruber [14] shows that "most" convex bodies K have the maximum number 
n(n + 3)/2 of contact points. See also [33] for a simpler proof. Similar results also hold for 
the maximum volume inscribed ellipsoids discussed in $3j see [14] . However, Rudelson [33] 
shows that for every e > and every convex body K, there exists a nearby convex body 
L whose distance (Banach-Mazur distance) to K is less than 1 + e and which at most 
k < C(e) ■ rtlog 3 n contact points. This has obvious implications for numerical algorithms 
that try to compute approximate covering ellipsoids. 

Remark 2.4. Let S = ext(K), the set of extreme points of K . We have K = conv(S'), the 
convex hull of S, by a Theorem of Minkowski, see Rockafellar [32], Corollary 18.5.1. Note 
that S and K have the same extremal covering ellipsoid, and applying Theorem 12.21 to S 
instead of K shows that we can choose u\ € S = cxt(K), i = 1, . . . , k. 

An independent proof of the above fact runs as follows: let x G dK n dE be a contact 
point. Noting dE = ext(E), we have x £ ext(E). If x ^ ext(K), then there exist y,z £ K, 
y =/= z, such that x lies in the interior of the line segment [y,z\. However, x £ [y,z] C E, 
contradicting the fact that x £ ext(E). 

The equation Y^t=i^i( u i ~ c ) = in ()2.6|1 gives c £ conv({tij}f). This immediately 
implies 

Corollary 2.5. Let K be a convex body in R". The contact points of CE(K) are not 
contained in any closed halfspace whose bounding hyperplane passes through the center of 
CE(K). 

For most theoretical purposes, we may assume that the optimal ellipsoid is the unit ball 
E(I n , 0). This can be accomplished by an affine change of the coordinates, if necessary. This 
results in the more transparent optimality conditions 

k k 

i=i i=i y 1 - 1 ) 

u,£dK^8B n , i=l,...,k, K C B n . 
Taking traces of both sides in the first equation above gives n = tr(7„) = tr(J^._ . X^UiuJ) = 
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J2^=n. (2.8) 



In this section and in convex duality will play an important role. If C is a convex 
body in R n , then the Minkowski support function of C is defined by 

sc(d) := max(d, it). 

It is obviously defined on R™ and is a convex function since it is a maximum of linear 
functions indexed by u. In fact, sc — S c , where 5c is the indicator function of C and * 
denotes the Fenchel dual. If C and D are two convex bodies, it follows from Corollary 13.1.1 
in [32] that C C D if and only if sc < sd- 

We compute 

se(x.c) (d) = max{(rf, u) : (X(u — c),u — c) < 1} 
= max{(d,c + X" 1/2 t;) : ||u|| < 1)} 

(2 9) 

= (c, d) + max (X~ 1/2 d, v) = (c, d) + | \X~ 1/2 d\ \ 
Nl=i 

= ( c ,d) + (X-^ct) 1 ' 2 , 
where we have defined v — X x l 2 (u — c) or u = c + X~ x l 2 v. 
The polar of the set C is defined by 

C° := {d: a c (d) < 1} = {x : (x,u) < l,Vu6(J}. 
An easy calculation shows that 

(conv({ui}^))° = {a; : (x,Ui) < 1, i = 1, ...,k}. 

The following is a key result. Among other things, it implies that the optimality con- 
ditions (|2.7p are powerful enough to prove the uniqueness of the minimum volume circum- 
scribed ellipsoid as well as the uniqueness of the maximal volume inscribed ellipsoid treated 
in g3 

Lemma 2.6. Let {ui}\ (k arbitrary) be a set of unit vectors in K" satisfying the conditions 

2j=i ^i u i u f — In and J2i=i ^i u i = 0- Define the polytope P — conv and its polar 

P° = {x : {ui,x) < 1, i = 1, ...,&}. The unit ball is both the unique minimum volume 
circumscribed ellipsoid of P and the unique maximum volume inscribed ellipsoid of P° . 

Proof. Let E(X, c) be any ellipsoid covering the points ■ We have (X(ui — c), m — c) < 
1 and 

n = 2J Aj > 2J A 1 (X(m 1 - c),i*i - c) = (X, 2J - c)(ttj - c) T ) 

i=l i=l z=l 

fa fa 



(X, ^ Aittiuf) - (X, cAittf) - (X, £ A^c T ) + (X, Aj) 



cc T ) 



= (X, I n ) + n(X, cc T ) = tr(A) + n(Xc, c) 
> n (det(X) 1 /™ + (Ac, c)" 
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Here the fourth equality follows from (|2.7|) , and the last inequality follows from the fact that 
det(X) 1 /™ < tr(X)/n which is precisely the arithmetic-geometric mean inequality applied 
to the eigenvalues of X. Thus 

detpf) 1 /™ + (Xc,c) < 1, 

and the equality det(X) = 1 holds if and only if c = 0, (X(ui — c),Ui — c) = 1 for all 
i = 1, . . . , fc, and the arithmetic-geometric mean inequality holds as an equality. The last 
condition holds if and only if X is a positive multiple of the identity matrix (and then 
det(X) = 1 implies X = I n ). Thus, the minimum volume ellipsoid covering the points 
{ui}\ must be the unit ball. 

Next, let E(X, c) be any ellipsoid inscribed in P° . It follows from ([2~3f that vol(E(X, c)) = 
det(X _1 )w n . By virtue of ([2~1?]) . the inclusion E(X,c) C P° implies 

SE(x,c){Ui) = (c,Ui) + \\X~ 1 / 2 u l \\ < spo(ui) = max(u t ,Uj) < 1, i = 1, . . . , fc. 

3 

The Cauchy-Schwarz inequality gives {X~ 1 / 2 Ui,u i ) < WX^^-^mW ■ \\ui\\ = ||X _1 / 2 Wj||. 
Therefore, 

k k 

n = ^T / \ t >^2\ l (<c, m) + (x-^ 2 Ul , Ul )^ 

i=l i=l 

= (X- 1 / 2 ,J2^iuJ)=tx(X- 1 / 2 ) >ndet(X)- l ' 2n , 

i=l 

where the last inequality follows from the arithmetic-geometric mean inequality applied to 
the eigenvalues of X -1 / 2 . Thus det(X) > 1, and the equality det(X) = 1 holds if and only 
if (i) X is a positive multiple of the identity matrix (and then det(X) = 1 implies X = 7„), 
and (ii) 1 = (c, Ui) + {X^ 1 / 2 ^, Ui) = (c, Ui) + 1, that is, (c, Ui) = for all i = 1, . . . , fc. Then 

the equation X^T=i ^i u i u f = In implies that ||c|| 2 = 53j=i ^i{ c i u i)' 2 — 0- Thus, condition 
(ii) holds if and only if c — 0. The lemma is proved. □ 

Theorem 2.7. Let K be a convex body in R™. The minimum volume circumscribed ellip- 
soid of K is unique. Moreover, the optimality conditions (12. 6| are necessary and sufficient 
conditions for an ellipsoid E(X,c) to be the minimum volume circumscribed ellipsoid of K . 

It is easy to see that K can be replaced by S = ext(if) in this theorem. 

Proof. The neccessity of the conditions (12 .6[) is already proved in Theorem l2.2l We assume, 
without any loss of generality, that E(I n , 0) = B n satisfies the optimality conditions (|2.7|) for 
some set of multipliers {A^}. Let E = CE(K). We claim that E = B n . This will immediately 
imply the remaining parts of the theorem. Since E C P, we have vol(S n ) > vo\(E) > 
vol(B n ) where the first inequality follows because E has minimum volume among ellipsoids 
circumscribing K and the second inequality follows from Lemma 12.61 The same lemma 
proves the claim that E = B n . □ 

The breadth of a convex body is the smallest distance between its two parallel support 
planes, and diameter is the distance between its two farthest points. The following result of 
John [TS] shows that a convex body can be "rounded" by an affine transformation. Its proof 
also gives valuable information about the locations of the contact points. Its easy proof is 
due to Juhnke [21]. 
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Corollary 2.8. Let K be a convex body whose optimal covering ellipsoid is the unit ball, 
and let {ui}i be its contact points. Then 

max(d, x) max(— d, x) > max(<i, Ui) max(— d, ud > — , V<i, \\d\\ = 1, 

x£K x£K i i n 

2 

max(d, x) + max(— d, x) > max(d, ui) + max(— d, Uj) > — =, Vd, lldll = 1. 
K K i i y/n 

Consequently, any convex body can be transformed by an affine map into a body, for which 
the ratio of breadth to diameter is at least 1 / y/n. 

Proof. Define P = conv({uj}J). Since sp(d) = m&Xk(d,Uk) > (d,Ui) for any i, we have 
fc 

< E x i( s p( d ) ~ ( d > Ui)){s P {-d) + (d, Ui )) 

i=l 
k 

= ^i) s p{d)sp(~d) - ^2 ^i(d, u i) 2 = nsp(d)s P (-d) - 1 

i—l i 

where the equalities follow from l|2.7p . This proves the first line of relations in the corollary. 
The second line of relations follow from the first one using the inequality (sp(d)+sp(—d)) 2 > 
4sp(d)sp(—d). Observe that the quantity SK(d) + SK(—d) = m.ax. xe K(d,x)+m&x X £K(—d,x) 
is the distance between the two parallel support planes of K in direction d. This proves the 
corollary for K whose optimal ellipsoid CE(iT) = B n . 

The rest of the corollary follows since an arbitrary convex body can be transformed by 
an affine transformation into another convex body whose optimal covering ellipsoid is the 
unit ball. □ 

We now give a proof of Fritz John's celebrated result mentioned at the beginning of this 
section. Our proof is simpler, and uses ideas from Ball p] and Juhnke [19] . 

Theorem 2.9. Let K be a convex body in R™ and E(X,c) = CE(-fT) be its optimal circum- 
scribing ellipsoid. The ellipsoid with the same center c but shrunk by a factor n is contained 
in K. If K is symmetric (K = —K), then the ellipsoid with the same center c but shrunk 
only by a factor \fn is contained in K . 

Proof. Without loss of generality, we assume that CE(K) = E(I n ,0) = B n . The theorem 
states that n~ 1 B n C K. Let 

P = conv ({tt;}i) 

be the convex hull of the contact points. We claim the stronger statement that n~ 1 B n C P. 
Since P C K , we will then have n~ 1 B n C K . By duality, the claim is equivalent to showing 
that the polar sets satisfy P* C (n~ 1 B n )* — nB n . Let x E P*. Since — ||sg|| = — ||x|| • ||tti|| < 
(x,Ui) < 1, we have 

k 

0<5^X i (l-<a; > Ui))(||a:|| + 

8=1 

k k 

= (E^)l^ll " E MM) 2 = "INI - INI 2 , 

i=l i=l 
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where the second equality follows from JZjAj = n and (|2.7|) . This implies ||x|| < n, and 
proves that P* C nS n . 

If X is symmetric, we define Q — conv ({iitj}^) C if and claim that n~ x l 2 B n C Q, or 
equivalently, that Q* C (nT 1 ! 2 B n )* — ^/nB n . It is easily shown that Q* — {x : |(x, < 
1, i = 1, . . . , fc}. Let x £ Q*. Since — 1 < (x, Ui) < 1, we have 

k 

z=l 

This gives ||x|| < \fn and proves the claim. □ 

Remark 2.10. The minimum volume covering ellipsoid problem may be set as a semi- 
infinite program in a different way, by replacing the set inclusion K C E(X, c) by the 
equivalent functional constraints SE(x,c)(d) > Sif(<i), that is by the constraints 

(c,d) + (Yd,d)^ 2 > S K(d), Vd, \\d\\ = l, 

where we restrict d to the unit sphere since support functions are homogeneous (of degree 
1), in order to get a compact indexing set. The resulting semi-infinite program is solved in 
the same way as (|2.4|) , and mirrors the solution to the maximum volume inscribed ellipsoid 
problem given in Sj3l 



3 The maximum volume inscribed ellipsoid problem 

Recall that the inscribed ellipsoid problem is the problem of finding a maximum volume 
ellipsoid inscribed in a convex body K in M. n . It will be seen that this ellipsoid is unique 
as well, and we denote it by IE(K). As we mentioned in the Introduction, this ellipsoid is 
often referred to as the John ellipsoid or Lowner-John ellipsoid. 

In this section, we again use semi-infinite programming to treat this problem. The 
inscribed ellipsoid has properties similar to those of the circumscribed ellipsoid. For example, 
the ellipsoid with the same center but blown up n times contains K, and in the case K 
is symmetric (K = —K), the ellipsoid needs to be blown up by a smaller factor y/n. The 
ellipsoid IE(K ) is very useful in the geometric theory of Banach spaces. It is also useful in 
some convex programming algorithms, such as the inscribed ellipsoid method of Tarasov, 
Erlikh, and Khachiyan [55] , 

As a first step, using (|2.3[) . we can formulate the inscribed ellipsoid problem as a semi- 
infinite program 

min{detX : E(X,c) C K}. 

However, this is hard to work with, due to the inconvenient form of the constraints, 
E(X, c) C K. We replace this inclusion by the functional constraints 

SB(x,c)(d) < s K (d), Vd e B n , 

where we again restrict d to the unit sphere since support functions are homogeneous (of 
degree 1), in order to get a compact indexing set. 

Defining Y = X^ 1 , we can therefore rewrite our semi-infinite program in the form 
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min — log det Y 

s.t. (c,d) + (Yd.d) 1 / 2 < s K (d), Vd:||d|| = l, 



(3.1) 



in which the decision variables are (Y, c) € S n x R™ and we have infinitely many constraints 
indexed by the unit vector ||d|| = 1. 

Since sk is a convex function on W 1 , it is continuous. Therefore, there exists a positive 
constant M > such that if (Y, c) is a feasible decision variable, then | (c, d) \ < M, and 
(Yd, d) < M for all ||<i|| = 1. This proves that the set of feasible (Y, c) for problem (|3.ip is 
compact, and implies that there exists a maximum volume ellipsoid inscribed in K. 

We derive the optimality conditions for the maximum volume inscribed ellipsoid. 

Theorem 3.1. Let K be a convex body inl". There exists an ellipsoid of maximum volume 
inscribed in K . If E(X, c) is such an ellipsoid, then there exists a multiplier vector A = 
(Ax, . . • , Afc) > 0, < k < n(n + 3)/2, and contact points such that 

k 

X^ 1 = ^2 X i( u * ~ C )( U i ~ c ) T i 
i=l 

o = ]T>K-c), (3-2) 

i=l 

UiedKndE(X,c), i = l,...,k, 
E{X, c) C K. 



Proof. The existence of a maximum volume ellipsoid inscribed in K is already proved above. 
Let E(X, c) be such an ellipsoid. Define Y = X~ l . Since constraints in (|3.ip are indexed by 
\\d\ \ = 1, Theorem 12 . 1 1 applies : there exists a non-zero multiplier vector (So, Si, ■ ■ ■ , Sk) > 0, 
where k < n(n + 3)/2, Si > for i > 0, and directions {di]\, \\di\\ — 1, satisfying the 
conditions 

( C ,d l ) + (Yd l ,d,) 1 ' 2 =S K (di), 

such that the Lagrangian function 

fc 

L{Y, c,S):= -S log det Y + 2 ^ S l [(c, di) + (Yd,, di) 1/2 - s K (di) 

i=l 

satisfies the optimality conditions 

fc 

= V c L(Y,c,S) = Y,Sidi, 

o = v,L ( y,c, 5) = -^- 1 + |:^^d i dr 

Recalling that ||dj|| = 1 and taking the trace of the right hand side of the last equation 
above gives S^tniY -1 ) = X^=i Si(Ydi,di)^ 1 ^ 2 . If <5o = 0, then all Si — 0, which contradicts 
5 0. Therefore, <5o 7^ 0, and we let Sq = 1. Define 

Ui :-c+(rd l ,d l )- 1 / 2 rd l , A, := (Yd, t , d^H,, i = l,...,k. 

We have (d,, Ui) = (c, di) + (Ydi, di) 1 ! 2 , so that 
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SE(x,c){di) = s K (di) = (di,Ui), 

which means that Ui £ dK n dE(X,c), that is Ui is a contact point. Rewriting the above 
optimality conditions in terms of {ui} and {A^} and simplifying, we arrive at the conditions 
(1X21. □ 



As in the circumscribed ellipsoid case, we have 

Corollary 3.2. Let K be a convex body in R". The contact points ofIE(K) are not con- 
tained in any closed half space whose bounding hyperplane passes through the center ofIE(K). 

We can simplify these conditions by assuming that the optimal ellipsoid is the unit ball 
E(I n , 0). Then the Fritz John conditions become 

k k 

i n = yjAjUjuf , o = \jUj, 

i=l i=l 

Ui£dKr\8B n , i = l,...,k, B n CK. 



(3.3) 



We note that the optimality conditions (|3.3p are exactly the same as the corresponding 
optimality conditions (|2.7p in the circumscribed ellipsoid case, except for the feasibility 
constraints B n C K. 

Theorem 3.3. Let K be a convex body in W l . The maximal volume ellipsoid inscribed in 
K is unique. Furthermore, the optimality conditions (|3.2[) are necessary and sufficient for 
an ellipsoid E(X,c) to be the maximal volume inscribed ellipsoid of K . 



The proof uses Lemma 12.61 It is omitted since it is very similar to the proof of Theo- 
rem 123 

We end this section by proving an analogue of Fritz John's containment results concern- 
ing CE(Jf). 

Theorem 3.4. Let K be a convex body in W 1 and let E(X, c) = IE(i^) be its optimal 
inscribed ellipsoid. The ellipsoid with the same center c but enlarged by a factor n contains 
K. If K is symmetric, then the ellipsoid with the same center c but enlarged by a factor s/n 
contains K . 



Proof. The proof here is similar to the proof of Theorem l2.9l Without loss of generality, we 
assume that IE(K) = E(I,0) = B n . The first part of the theorem follows if we can prove 
the claim that 

K CP* C nB n . 

Since 1 = sic(ui) = max xg x(wi, x), the first inclusion holds true. If x £ P*, then — ||x|| = 
' \\ u i\\ ^ ( x ,Ui) < 1, and 



k 

< - (x, Ul ))(\M\ + (x,ih)) = n\\x\\ - \\x\\ 2 , 

i=l 



where the equality follows from A,; = n and (|3.3p . This implies ||x|| < n, and proves the 
second inclusion in the claim. 

If K is symmetric, we define Q = conv ({iu,}^) C K and claim that K <Z Q* £ ^RB n . 
Since 1 = sk we have x) \ < 1, and the first inclusion follows. To prove the second 
inclusion in the claim, let x £ Q* . We have \ (x,m)\ < 1, and the rest of the proof follows 
as in the proof of Theorem 12.91 □ 
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4 Automorphism group of convex bodies 



Let K be a convex body in R". The uniqueness of the two extremal ellipsoids CE(A) and 
IE(A) have important consequences regarding the invariance properties of the two ellipsoids. 
We will see in this section that the symmetry properties of the convex body K is inherited 
by the two ellipsoids. If K is symmetric enough, then it becomes possible to give explicit 
formulae for the ellipsoids CE(A) and IE(if). We will demonstrate this for some special 
convex bodies in the remaining Sections of this paper. 

Apart from its intrinsic importance, the invariance properties of the ellipsoids CE(A) 
and IE(K) have important applications to Lie groups [TT], [25], to differential geometry [24] . 
and to the computation of the extremal ellipsoids for some special polytopes and convex 
bodies [6], [4], among others. 

We start with a 

Definition 4.1. The (afhne) automorphism group Aut(A) of a convex body K in W 1 is 
the set of afhne transformation T(x) = a + Ax leaving K invariant, that is, 

Aut(A) = {TO) = a + Ax : T(K) = K}. 

Note that since < vol(K) = vol(T(A)) = | det A\ vol(A), we have | det A\ = 1. 

It is shown in [14] that the automorphism group of most convex bodies consists of the 
identity transformation alone. This is to be expected, since the symmetry properties of a 
given convex body can easily be destroyed by slightly perturbing the body. Nevertheless, 
the study of the symmetry properties of convex bodies is important for many reasons. 

The ellipsoids are the most symmetric convex bodies. Therefore, we first investigate 
their automorphism groups and then relate them to the automorphism groups of arbitrary 
convex bodies. 

Definition 4.2. Let A be an invertible matrix in SIR™*™. Equip M™ with the quadratic 
form (Au, v) which we write as an inner product 

(u,v)a ■= (Au,v). 

We denote by 0(R n , A) the set of linear maps orthogonal under this inner product, 

0(W\A) :={geR nxn :g*g = gg*=I} 

= {g e K nxn : {gu,gv) A = (u,v) A , V«,w G K"} 
= {g € R" x " : g T Ag = A}, 

where g* is the conjugate matrix of g with respect to the inner product (-,-)a, that is 
(gu,v)A — (u,g*v)A for all u,v in R™. The second equality above follows as (gu,gv)A = 
(u, g*gv)A and this equals (u,v)a if and only if g*g — I. The third equality follows since 
(g T Agu,v) — (Agu,gv) — (gu,gv)A = (u,v)a = {Au, v). If A is positive definite, then 
(M n , (-,-)x) is a Euclidean space. In particular, O n := 0(M. n ,I) is the set of orthogonal 
matrices in the usual inner product on K™. 

Lemma 4.3. If X is a symmetric, positive definite nx n matrix, then 

Aut(£(A,0)) = 0(R n ,X). 
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In particular, Ant(B n ) is the set of n x n orthogonal matrices. We also have 

Aut{E(X, c)) = T c O(R", A)T_ C , 

where T c is the translation map T c x — c + x. Moreover, Aut(-B(X, c)) fixes c, the center of 
E(X,c), that is, 9(c) — c for every 9 in Aut(E(X,c)). 

Proof. We first determine Aut (£?„). Let T be in Aut(B n ), where T(x) = a + Ax. Since T 
maps the boundary of B n onto itself, we have q(x) := \ \a + Ax\\ 2 = (A T Ax, x) + 2(A T a, x) + 
||a|| 2 = 1 for all ||x|| = 1. Then q(x)-q(-x) = A(A T a,x) = for all ||x|| = 1, which implies 
that A T a — and since A is invertible, a — 0. Consequently, q(x) = (A T Ax,x) — 1 for all 
||x|| = 1, which gives A T A — I n , that is, A is an orthogonal matrix. 

Next, we determine Ant(E(X, 0)). We have the commutative diagram 
B n E(X,0) — ^— E(X,c) 

U 8 9 

B n E(X,0) E(X,c) 

where 9 e Aut(E(X,c)), U G Aut(B n ) = O n , and 9 Q G Aut(E(X, 0)). From the diagram, 
we have I = U T U = {X 1 ' 2 9 a X- 1 / 2 ) T {X 1 / 2 9 X~ 1 / 2 ) = X~ 1 /' 2 9j ) X9 X~ 1 / 2 . This gives 
6$X6 = X and proves that Aut(E(X,Q)) = 0(R", X). 

Since Xjr 1 = T_ c , we have 

Aut(£(X, c)) = T c o Aut(S(X, 0)) o T_ c . 

Every 6» in Aut{E(X,c)) has the form 9{u) = (T c o 9 o T_ c )(u) = T c {9 {-c + u)) = 
(c — 6*oc) +#ou for some in Aut(-Bo)- This gives 0(c) = c, meaning that Aut(£'(X, c)) hxes 
the center of E. □ 

Definition 4.4. Let K be a convex body in R n . An ellipsoid E = E(X, c) is an invariant 
ellipsoid of K if Aut(X) C Aut(-B), that is, if every automorphism of K is an automorphism 
of E. 

It immediately follows from Lemma 14.31 that Aut(-ftT) fixes the center of any invariant 
ellipsoid E of K. 

The following Theorem in Danzer et al. [llj is a central result regarding the symmetry 
properties of the extremal ellipsoids. 

Theorem 4.5. Let K be a convex body in W 1 . The extremal ellipsoids CE(A") and IE(A') 
are invariant ellipsoids of K . Thus, Aut(K) C Aut(CE(AT)), Aut(K) C Aut(IE(AT)), and 
Aut(K) fixes the centers of CE(K) andlE(K). 

Proof. Since the arguments are similar, we only prove the statements about CE(if). Let g G 
AvX(K). Since K C CE(Jf) and K = gK C g(CE(K)), the ellipsoids CE(K) and g(CE(K)) 
both cover K, and since vo\(GE(K)) = vol(g(GE(K))), they are both minimum volume 
circumscribed ellipsoids of K. It follows from Theorem O that g(CE(i^)) = CE(AT). □ 

Corollary 4.6. The automorphism group Ant[K) of a convex body K in R™ is a compact 
Lie group. 
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Proof. We have Aut(K) C Aut(CE(A)) by Theorem E3J and Lemma 0~3] implies that 
Aut(CE(A)) is compact. Clearly, Aut(A) is a closed subset of the general affine linear 
group in R™. It follows that Aut(A) is a compact group. A classical theorem of von Neumann 
implies that it is a Lie group. □ 

Remark 4.7. The existence of invariant (fixed) points and (symmetric positive definite) 
matrices for Aut(A) have been demonstrated above using the invariance properties of the 
either one of the extremal ellipsoids CE(A) and IE(K). The same goal can be achieved in 
at least two other ways, using the invariance properties of either the center of gravity of 
convex sets, or of the Haar probability measure [lq on G = Aut(A). In a certain sense, all 
three procedures are similar in that they all employ averaging, but in different ways. 

By definition, the center of gravity of a convex body A is the point 

f K xdx _ (J K xi dx, ... , J K x n dx) 



cg{K) 



J K dx - vo\(K) 



One may think of cg(K) as the limit of the points p :— 5Zj =1 (vol(Ai)/ vo\(K))xi where 
{Ki]\ is a partition of A into subregions and Xi in Ki. Since p E A, we see that cg(A) lies in 
A. If T E Aut(A), then Tp = £ti( vol (^)/ vo\{K))Tx t = £ti( vol (^)/ vol{K))T Xi . 
We see that both {p} and {Tp} converge to cg(K ), proving that Aut(A) fixes the center of 
gravity of A. 

Moreover, one can prove the existence of an invariant ellipsoid for A, see [28], pp. 130-135: 
it is easy to verify that the map it : Aut(A) — * GL(SR nxn ) given by the formula 

n(g)(S) = (g®g)(S) := gSg T 

is a representation of the group Aut(A") on the matrix space SK"*™, that is, Tr(gh) = 
■K(g)iT{h): 

n(gh)(S) - ghSh T g T = g{*{h){S)g T = 7r(g)n(h)(S). 
Consider the group Q := 7r(Aut(A)) and the orbit 

C s - G(S) = {gSg T : g E Aut(A)} 

where S E EM nxn is an arbitrary positive definite matrix. Define the convex body /C = 
conv(Cs) C SR™ xn . Clearly, the orbit Cs is invariant under the group G, and thus so is the 
convex body JC. It follows from the above argument that the center of gravity Y — cg(/C) 
is fixed by G, that is, gYg T = Y for all g in Aut(A). This gives (ff" 1 ) 7 ^ 1 ^ 1 = S~ l , or 
equivalently, g T S~ 1 g = S^ 1 for all g in Aut(A). This means that X = Y~ x is invariant 
under Aut(A). Then any ellipsoid E(X, c), where c E K is any fixed point of Aut(A), say 
the center of gravity of A, is an invariant ellipsoid of K. 

Let n — hg be the unique Haar probability measure on G = Aut(A). If / : G — * R is a 
continuous function and h,k E G are arbitrary, we have 

f(hg)dfi(g)= [ f(g)dp(g)= [ f(gk)dp(g)= [ fig^dpig), 

G JG JG JG 

where the first and second equalities express left and right invariance properties of the 
Haar integral, respectively. If x is any interior point of A, then the point c :— f G gxd[i(g) 
lies in interior(K) and is fixed by Aut(A), for if h E Aut(A), then he = J G hgcdp(g) = 
J G gcd /i(g) — c, where the second equality follows from the left invariance of /i. Finally, 



15 



the standard proof of the existence of a positive definite invariant matrix X found in most 
textbooks proceeds as follows: start with any inner product (-, •) on R™, and define the inner 
product 

[[u,v]] := / (gu,gv) dfi(g). 
Jg 

This is an invariant inner product on G, because 

[[hu,hv]}= / (hgu, hgv) dfi(g) = / (gu,gv) dfi(g) = [[u,v]], 
Jg Jg 

where the second equality follows again from the left invariance of //. We have [[u, u]] non- 
negative and equal to zero if and only if u = 0, because the same thing is true for (u, u). 



We have seen that convex bodies give rise to compact affine groups through their auto- 
morphism groups. Conversely, it is a well known fact in Lie group theory that a compact 
Lie group can be imbedded as a closed subgroup of the linear group GL(V) for some finite 
dimensional vector space V. This can be found in most books on Lie groups, see for example 
[E]) [2H], [iQj- If G is a compact, affine Lie group in R™, then it is isomorphic to the compact 
linear group 

'A a} 



T = ( J :T eG, T{x) = a + Ax 



r +1 . In fact, if T(x) = a + Ax, then 

f 



x\ ( a + Ax 

V V 1 

Thus, the affine transformations of R n are in one-to-one correspondence with the linear 
transformations of R n+1 that keep the hyperplane {(x, 1) : x € R™} invariant. If G is a 
compact linear group in R™, then one can consider the convex set K = conv(Gx) where x is 
an arbitrary point in E™. Clearly, K is invariant under G, that is, G C AvX(K). In this way, 
one can obtain the following purely group theoretical result employing one of the methods 
in RemarkETO see [TT], [2S], 00]: 

Theorem 4.8. Let G be a compact group of linear transformations on W 1 . The group G 
fixes a point in W 1 , that is, there exists c in R n such that gc = c for all g in G. Moreover, 
there exists a positive definite matrix X invariant under G, that is, g T Xg = X for all g in 
G. 



We remark that the Haar measure approach in Remark 14.71 already proves this result 
directly, without considering orbits and convex sets. One can also extend the theorem to 
affine compact groups, for example by considering the isomorphic linear compact group in 
R" +1 . However, the method of employing convex bodies does have its merits. For example, 
a simple proof of the algebraicity of compact linear groups can be found in [28], pp. 130-135 
and 40J, Chapter 15, using this approach. 

The existence of the invariant quadratic form for compact Lie groups is a very important 
result, since it implies that its linear representations are completely reducible, that is, its 
representations can be written as direct products of irreducible representations. For the 
group Aut(_ftT), this simply means that the matrices appearing in the linear parts of the 
affine transformations in Aut(fC) must all have the same block diagonal structure. 
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We also mention that such concepts as the fixed points of Aut(iT) as well as its invariant 
quadratic forms belong to the invariant theory. For example, if G is a linear group in 
GL(R"), the invariant polynomials of G is the set of polynomials 

M[sci,. ..,x n ] G := {pe R[xx,...,x n ] : p{gx) =p(x), Vg G G, V x £ R"}. 

The determination of the invariant polynomials for specific groups is one of the major 
goals of invariant theory whose origins go back to the works of Cayley, Sylvester, Gordan, 
Hubert, etc. in the 19th century. A major result going back to Hilbert in 1890s implies 
that R[xi, . . . , x„] Aut ^ is finitely generated. This means that there exists finitely many 
invariant polynomials 5* (which can be assumed homogeneous) such that every invariant 
polynomial in R[xi, . . . , j; n ] Aut ^' can be written as a polynomial of elements of S. See 
[40] . pp. 280-281 for a fairly simple, direct proof. A direct significance of this result for the 
ellipsoid CE(iT) = E(X,c), say, is that both c and X are invariant. Consequently, if the 
number of generators is small, then the complexity of finding the ellipsoid CE(K) simplifies. 
This will be demonstrated in the remaining Sections of our paper. 

The automorphism group proves to be useful in other ways as well. For example, the 
following result of Davies [T^] is quite interesting. Davies calls a convex body K in R™ 
symmetric if Aut(A') acts transitively on the extreme points of K, that is, given any two 
points x, y in ext(K), there exists a transformation g in Aut(-fT) such that gx = y. He then 
proves the following result 

Lemma 4.9. If K = conv(Gx) is a symmetric convex body in R n in the sense Davies, then 
there exists a unique fixed point of Aut(K) in K. 

Proof. Let fi be the Haar probability measure /i on the compact Lie group G :— Aut(K). 
Let x be an arbitrary point in ext(AT). Since K is symmetric, ext(K) = Gx. Define the 
point 

c:= / gxdfi(g). 
Jg 

The point c is invariant under the action of G, since if ft. 6 G, we have he — J G hgxdfi(g) = 
J G gxd[i(g) = c, where the second equality is a consequence of the invariance of \i. Now, if 
a is any invariant point in K, then ga = a for all g in G, and we have a = J G ad^i(g) = 
J G gad/j,(g). By Minkowski theorem, we have a 6 conv(ext(K)) — conv(Ga;), that is, a — 

Si=i ^i9i x f° r some {\i,gi}i where gi in G and Ai > 0, Y^i=i = 1- Therefore, 
a= gadfi(g) = A 4 / gg l xdfi(g) = A») / gxdfi(g) = c, 

JG i=1 JG i=1 JG 

where third equality follows again from the invariance of the measure \x. □ 

The lemma implies in particular that the center of gravity of a symmetric body K is the 
only invariant point of K under the action of Aut(-fT). Thus, the centers of the ellipsoids 
CE(K) and IE(AT) must coincide and be equal to the center of gravity of K. 

It is also possible to determine a formula for the matrix X in the circumscribing ellipsoid 
CE(if) =E(X,c). 

Lemma 4.10. Let K C 1" be a convex body, symmetric in the sense of Davies. If X is 
an invariant matrix of K such that X^ 1 £ conv ({(y — c)(y — c) T : y G ext(-K")}J where c is 
the invariant point of K , then 
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X- x = / g-'ix-cXx-cfig^-U^g), 
Jg 

where x is an arbitrary point in ext(-ftf) and fj, is the Haar probability measure on K . 

Proof. Pick an arbitrary point x in ext(-fT) such that ext(K) — Gx. We may assume without 
any loss of generality that cg{K) = 0. Since X is an invariant matrix, we have g T Xg = X or 
X- 1 = g- x X- x g- T where we defined g~ T := {g- l ) T = (g T )-\ Let X' 1 = J^Li *iViVi = 
2i=i KhiXX T hf where hi £ G. We have 

k 

X- x = ( g- 1 X- 1 g- T dn{g)=Y j \ i [ (9~ 1 h i )xx T '(g^hif dfx(g) 
Jg i=1 Jg 

k 

= y2 Xi / 9~ 1 xx T g~ T dfi(g) = / g' 1 xx 7 ' g~ T d^(g) , 
i=1 Jg Jg 

where the last equality follows from the right invariance of the Haar measure. □ 

Corollary 4.11. Let K be a convex body in W 1 symmetric in the sense of Davies. The 
extremal covering ellipsoid of CFi(K) — E(X,c) has center c = cg(K) and 

X- l = n f g- 1 (x-c)(x-c) T g- T dii{g), 
Jg 

where x £ ext(K) is an arbitrary point and fi is the Haar probability measure on K . 



This is an immediate consequence of the above lemma, Theorem 12.21 and J2 

Many interesting questions and research directions remain regarding the automorphism 
groups of convex bodies. It is not practical to investigate these in this paper; doing so would 
increase the size of the paper beyond reasonable bounds and also change its character. We 
plan to pursue these issues in future papers. 



5 Invariance properties of a slab 

In this paper, one of the problems we are interested in is the determination of the extremal 
ellipsoids of the convex body K which is the part of a given ellipsoid E(Xq, cq) between two 
parallel hyperplanes, 

K = {x: (X (x - c ), x - c ) < 1, a < (p, x - c ) < 6}, 

where p is a non-zero vector in R" , and where a and b are such that K is nonempty. Recall 
that we call such a convex body K a slab. 

In this section, we determine Aut(K) and the form of the ellipsoids CE(K) and IE(K). 

If we substitute u = Xq^ 2 (x — Co), that is, x = cq + X 1 ^ 2 u, the quadratic in- 
equality (Xo(x — co), x — co) < 1 becomes < 1 and making the further substitution 
q := X 1 ^ 2 p, the linear form (p,x — cq) becomes 

(p,x-c Q ) = (Xq 1/2 p,xI /2 (x - co)) = (q,u). 
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Defining p = q/\\q\\, a — a/\\q\\ and (3 = b/\\q\\, the linear inequalities a < (p,x— cq) < b 
reduce to a < (p, u) < (3. Altogether, these substitutions give 

K = { C0 + A7 1/2 U :|M|<1, a<(p,u}</3}. 

Let Q be an orthogonal matrix such that Qe± = p, where e x = (1,0, . . . , 0) T in R™. Defining 
v = Q~ 1 u 1 we finally have 

K = {c + AV 1/2 Q«:|M|<l, a< (e lt v) < /?}, 

that is, K = c + X~ 1/2 Q(K), where K = {v : \\v\\ < 1, a < (e u v) < /?}. 

Since an affine transformation leaves ratios of volumes unchanged, we assume from here 
on, without loss of any generality, that our initial convex body K, which we denote by B a p, 
has the form 

B a0 = {xeWL n :\\x\\ <1, a<x x < /?}, (5.1) 

where — 1 < a < (5 < \. 

Remark 5.1. To simplify our proofs, we assume in this paper that /3 2 > a 2 . We can always 
achieve this by working with the convex body —B a p instead of B a p, if necessary. 



We use the symmetry properties of B a p to determine the possible forms of its extremal 
ellipsoids. This idea seems to be first suggested in [33] for determining CFi(B a p). 

Lemma 5.2. If a = —1 and /3 = 1, then the automorphism group of the slab B a p = B n 
consists of the n x n orthogonal matrices. In the remaining cases, the automorphism group 
Aut(B a p) consists of linear transformations T{u) = Au where A is a matrix of the form 



A 



an 
A 



an £ R, A E O n _i, 



with an = 1 if ol ^ —j3 and an = ±1 if a 



-(3. 



Proof. It is proved in Lemma 14.31 that Aut(_B„) = O n , so we consider the remaining cases. 

„T1 



Let T(x) = a + Au be an automorphism of B a p. We write A = 



and a — (ai , a) 



an c_ 
b A 

where an and ai are scalars and the rest of the variables have the appropriate dimensions. 
Since T is an invertible affine map, T(ext(B a/ 3)) = ext(B a p), where ext(B Q , i a) is the set of 
extreme points of B a /3 given by 

cxt(B al3 ) = [u = (ui, (1 - ul) 1/2 u) elx R n_1 : a < m < /3, \\u\ \ = 1, = l} . 

If u = (ui, (1 — u 2 ) 1 / 2 ^) is in ext(B a p) with ||m|| = 1, then 



w := a + Au 
We have ||u;|| = 1, that is, 



anui + (1 - u\) 1/2 {c, u) + a a 
uib + (1 - ulY^Au + a 
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1 = (a n u! + ai) 2 + (1 - u 2 )(c,m) 2 + 2(1 - u 2 ) 1/2 (c, u)(onUi + ffli) 
+ ||ui6 + a|| 2 + (1 - u 2 )||Au|| 2 + 2(1 - u 2 1 ) 1 ^(Au,u 1 b + a) 

- {(ouui + ai) 2 + + a|| 2 } + (1 - u 2 ) ((A T A + cc t )m, u) (5.2) 
+ 2(1 - tt 2 ) 1/2 ((a n ui + ai)c + A T (ui& + a), u) 

=:«(«), V||u|| = l. 

Fix Mi € (ct,0), so that 1 — u 2 ^ 0. The argument used in the proof of Lemma T4.3I shows 
that 

(anui +ai)c + A T (u 1 b + d) = 0, V«i £ (a,f3), (5.3) 

and that (1 - ui) 2 ((A T A + cc t )m, u) = {l - (ontti +a x ) 2 - ||ui& + a|| 2 } ||w|| 2 for all it G 
R™- 1 , that is, (1 -ui) 2 (A T A + cc T ) = {l - (ouiii + ai) 2 - \\uib + a|| 2 } 7„_i, for all m G 
(a,/3). Therefore, there exists a constant fc such that 

fc/„-i = A T A + cc T , 

= (on«i +ai) 2 + ||ui& + a|| 2 + fc(l - u 2 ) - 1, Viti G (a,/3). 

The equation (|5.3p implies the first two equations in (|5.4p , while the equation above gives 
rest of the equations below, 

= A T b + a 11 c, = A T a + a 1 c, fc/ n _i = A T A + cc T , (5.4) 

= a 2 1 + ||6|| 2 -fc, = oio u + (6,o), = a 2 + ||a|| 2 + fc-l. (5.5) 



We have, therefore, 



A T A ■ 



2 ...Jj ^ 





= a 2 




'fc 




fcj„_i_ 


, we 


have fc = 



kin 



at A is 

an orthogonal matrix. Furthermore, the last equation in (|5.5[) gives a = (ai,a) = 0. 
Let a; be in B a p. As ||Ax|| = ||ac|| < 1 and (ei,x) = (Ae±,Ax), we have 

B a0 = AB a0 = {Ax : \\x\\ < 1, a < (e l7 x) < [3} = {x : \\x\\ < 1, a < {Ae u x) < 13}. 

If a ^ —(3, then we must have Ae± = e±, and if a = —(3, then B a p = —B a p and we have 

311 
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^4ei = ±ei. Since Aei = ( I , we see that |an| = 1 and b = c = 0. It is then clear that 



A belongs to O n -i- 

Conversely, it is easy to verify that any matrix A in the form above is in Aut(fT). □ 

Lemma 5.3. The extremal ellipsoids CE(i? Q ^) and IE(B Q| a) have the form E(X,c) where 
c = r&\ and X — diag(a, b, b) for some a > 0, b > and r in BL Moreover, if a — —f3, 
then c = 0. 



Proof. Since the proofs are the same, we only consider the ellipsoid CF>(B a p). Let U 



1 

£/ 



be in Aut(TT). Write c — (ci, c). It follows from Theorem 14. 51 that J7c = c. This implies that 
Uc — c for all U in O n _i. Choosing U = — J n -i, we obtain c = 0. If a = — /3, then choosing 
£7 = —I n in O n gives c = 0. 
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Let CE(B al3 ) = T(B n ) = E(X,c) where T(x) = c + X^^x. Lemma [O] implies that 
U T XU = X, and writing X = 



T' 

xn v 1 



^ , this equation gives U T v — v and U T XU — X for 

all U in n -i. The first equation implies v = 0. In the second equation, we can choose U so 
that the left hand side is a diagonal matrix, proving that X itself must be a diagonal matrix. 
If U is the permutation matrix switching columns i and j, then the equation U T XU = X 
gives Xu = Xjj. This proves that X = diag(a, b,b,...,b) for some a > 0, b > 0. □ 



6 Determination of the minimum volume circumscribed 
ellipsoid of a slab 

In this section, we give explicit formulae for the minimum volume circumscribed ellipsoid 
of the slab B a p in (|5.ip using the Fritz John optimality conditions (|2.6|) and Lemma 15.31 
In this section, K will always denote the convex body B a p. 

The following theorem is one of our main results in this paper. 

Theorem 6.1. The minimum volume circumscribed ellipsoid CE(B Q ^) has the form 
E(X,c) where c = re\ and X = diag(a, b, . . . , b), where the parameters a > 0, b > 0, 
and a < r < f3 are given as follows: 

(i) If a (3 < —1/n, then 

t = 0, and a = b = 1. (6-1) 

(ii) If a + (3 = and a/3 > then 

r =°' a =42< b = ( 6 - 2 ) 

(Hi) If a + [3 ^0 and a/3 > —1/n, then 

_ n{(3 + a) 2 + 2(1 + af3) - yfK 



2(n+l)((3 + a) 
1 b _ 1 -a(r-a) 2 



(6.3) 



n(r — a)(/3 — r) ' 1 — a 2 ' 

w^ere A = n 2 (/3 2 - a 2 ) 2 + 4(1 - a 2 )(l - f3 2 ). 

We remark that Corollary 12.81 also implies the converse of Part (i) in the theorem. 



Proof. Lemma 15.31 implies that X = diag(a,b, ...,&) and c = re.\. Writing Ui = {Vi,Zi) € 
l,...,k, where 1 = ||uj|| 2 = yf + \ \zi\\ 2 , that is ||^|| 2 = 1 - y 2 , and noting 

~(y t -T) 2 (y t -T)zf\ 

(yi - r)z t ZizJ 
optimality conditions (|2.6|) may be written in the form 



that (iii — c)(u,i — c) 



, the Fritz John (necessary and sufficient) 
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k k k 

t= -}^XiVi, = ^A l z l , Xj = n, (6.4) 

i—1 i—1 i—1 

- = X^A t (y l -t) 2 , = ^Ai(j/i-r)^, ~/ n _i = ^ AiZ^f, (6.5) 

2—1 i—1 i—1 

0= a( yi -r) 2 +b(l-y 2 )-l, i=l,...,k, (6.6) 
0> a(y-r) 2 +6(l-j/ 2 )-l, Vye[a,0\. (6.7) 

The last line expresses the feasibility condition if C E(X, c): any point x = (y, z) satisfying 
a _• 2/ _• and ||x|| = 1 lies in J^, hence in E(X,c), so that it satisfies the conditions 
y 2 + \\z\\ 2 = 1 and o(l/ - r) 2 + &||z|| 2 < f. 

The conditions ([6T^ ]l — (|6T7 Jl thus characterize the ellipsoid CE(K) for K = B a0 in |[5T]) , 
Since the ellipsoid CE(if ) is unique, its paremeters (r, a, 6) are unique and can be recovered 
from the above conditions. These are done in the technical lemmas below. □ 

Lemma 6.2. If a~b in the ellipsoid GEi(B a p), then r = 0, a = b = 1, and a/3 < —1/n. 



Proof. Since a — b, (|6.6[) gives the equation 2aryi — ar 2 + a — 1 . We have r = 0, since 
otherwise all yi are the same, and the first and third equations in (|6.4[) imply that g/j = t, 
contradicting the first equation in (|6.5p . The equation 2aryi = ar 2 + a — 1 reduces to 
a = 1 = 6. Finally, since a < t/^ < (3, we obtain 

i—1 i—1 i—1 i—1 

where the last equation follows since • 1 A^y? = 1 from the first equation in (|6.5p and 

Eti a<w = o, Eti A > = « from El ■ □ 

Lemma 6.3. If a ^b in the ellipsoid CE(_B a| g), i/ien a > b and the leading coordinate yi 
of a contact point must be a or (3, and both values are taken. 



Proof. Observe that the function g(y) := a(y — r) 2 + 6(1 — y 2 ) — 1 in (|6.7[) is nonpositive 
on the interval / = [a, f3\ and equals zero at each yi. We claim that yi can not take a single 
value: otherwise the first and third equations in (16. 4|) imply that yi — r, contradicting the 
first equation in (|6.5| . (This result also follows from Corollary 12.51 ) Since g is a quadratic 
function, yi must take exactly two values, and (|6.7|) implies that these two values must 
coincide with the endpoints of the interval /. Furthermore, g{y) < only on /, has a global 
minimizer there, and so it must be a strictly convex function. This proves that a > b. □ 

Lemma 6.4. If a ^ b in the ellipsoid GE(B a p), then (r,a,b) are given by the equations 
((672]) and l[673]) . Moreover, a/3 > -1/n. 



Proof. Lemma ROl and equation (|6.6|) give a(/3— t) 2 +6(1 — /3 2 ) = 1 and a(a— t) 2 +6(1 — a 2 ) 
I Subtracting the second equation from the first and dividing by (3 — a ^ yields the 
equation r = (1 — b/a)(a + /3)/2. We also have 
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= J2 X >(y* - - P) = E - t) - (« - T")] • [(W -r)-(J3- r)] 



i=l i=l 
k k 



= J2^ iVi - r) 2 - (a + /3 - 2r) ]T A, (y, - r) + (a - r)(0 - r) £ A, 

z— 1 z— 1 i— 1 

= -+n(a-r)( ( 9-r), 
a 

where the first equation follows from Lemma 16.31 the last equation from (|6.4p and (|6.5|) . 
Altogether, we have the equations 

l = a(a-T) 2 +b(l-a 2 ), r = ( 1 - - J • - = n(r - a)(/3 - r), (6.8) 



ay 2 a 

which we use to compute the variables a, 6, and r. 

If a = — 0, then the second equation above gives t = 0. Then the third and first equations 
in (|6.8p give a = l/(na 2 ) and b — (n— l)/(n(l — a 2 )), respectively. Lastly, Lemma [5731 gives 
a > b, and this implies 1 + na0 > 0. 

If 7^ —a, then the first and third equations in (|6.8p give (a — r) 2 + (6/a)(l — a 2 ) = 
n(r — a— )(/3 — r) and the second equation gives b/a = 1 — 2t/(q + Substituting this 
value of b/a in the preceding one leads to the quadratic equality for r, 

(n + 1)(q + [3)t 2 - (n(a + (if + 2(1 + a/3)) t + (a + 0){l + na(3) = 0. (6.9) 

A straightforward but tedious calculation shows that the discriminant is A = n 2 (f3 2 — 
a 2 ) 2 + 4(1 — (3 2 )(1 — a 2 ) > 0. We claim that the feasible root is the one with negative 
discriminant. If r is the root with positive discriminant, then t— = [n(a 2 — /3 2 )+2(l— f3 2 ) + 
v/A]/(2(n+l)(a+j3)) > 0. Recalling that (3 2 > a 2 , we have [n(a 2 - (3 2 ) + 2(l- (3 2 )] 2 - A = 
4(n + 1)(1 — (3 2 ){a 2 — (3 2 ) < 0. This gives r > 0, proving the claim, as we must have 
a < t < 0. Therefore, 

n(a + 0) 2 + 2(1 + a/3) 

r = 



2(n+l)(a + 0) 
1 b _ l-a(a-T) 2 



n(r — a)(0 — t) ' 1 — a 2 

where the equations for a and £> follow from the first and second equations in 

Finally, r = (1 — b/a)(a + 0)/2 from (|6.8p and Lemma IQ1 gives a > b, implying r > 0. 
From the formula above for r, wc get 

< (n(a + /3) 2 + 2(1 + na0)) 2 - A = 4(n + l)(a + /5) 2 (1 + najS). 

This gives 1 + na/3 > 0. The lemma is proved. □ 

Remark 6.5. Some of the results contained in Theorem 16.11 may seem very counter 
intuitive. For example, consider the slab B a p when —a — — \j\pri. Although the width 
of this slab is 2/y/n, very small for large n, the optimal covering ellipsoid is the unit ball. 
This seemingly improbable behavior may be explained by the concentration of measure 
phenomenon: most of the volume of a high dimensional ball is concentrated in a thin strip 
around the "equator" , see for example [2] ■ There is a sizable literature on concentration of 
measure; the interested reader may consult the reference [25] for more information on this 
important topic. 
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6.1 Determination of the covering ellipsoid by nonlinear program- 
ming 

In this section, we give a proof of Theorem 16.11 which is completely independent of the 
previous one based on semi-infinite programming. The proof uses the uniqueness of the 
covering ellipsoid, Lemma [5. 31 on the form of the optimal ellipsoid, and Corollary 12.51 We 
thus need proofs of the first and the last results that do not depend on the results of <j2] 
and Theorem 16.11 We note that an elementary and direct proof of the uniqueness of the 
optimal covering ellipsoid can be found, for example, in Danzer et al. |llj . and we supply 
an independent, direct proof of Corollary 12. 51 This last result is not strictly necessary, but 
it simplifies our proofs, and it may be of independent interest. 

Recall that Corollary [275] states that the contact points of an extremal covering ellipsoid 
cannot lie any half space whose bounding hyperplane passes through the center of the 
ellipsoid. The following is an independent proof of this fact, in the spirit of the proof in 
Grunbaum [15 for the maximum volume inscribed ellipsoid. 



Proof. We assume without loss of generality that the ellipsoid is the unit ball B n . Clearly, 
it suffices to show that the open halfspace B + := {x : x n > 0} contains a point of E n dK . 
We prove this by contradiction. 

Consider the ellipsoids E(X) — {x : f(x) — aYTi=\ x \ + b{x n + A) 2 < 1}, having on 
their boundary the points {e^}" -1 and — e n = (0,0,... ,0,-1). We have b = 1/(1 — A) 2 , 
a = 1 - A 2 /(l - A) 2 = (1 - 2A)/(1 - A) 2 , and 

vol(£(A)) = (a"-^)- 1 = J^P - 

We claim that K C E(X) for small enough A > 0. On the one hand, if ||x|| = 1 and x n < 0, 
we have f(x) — 1 = rprxp ( x n + x n) < 0- O n the other hand, since B + contains no contact 
points, there exists e > such | \x\\ < 1 — e for all x E K n B + . It follows by continuity that 
K n B + C E(\) for small enough A > 0. These prove the claim. 

Lastly, vol(J5(A)) < 1, since (1 - A) 2 " - (1 - 2A)"- 1 = [1 - 2nA + o(A)] - [1 - (n - 
1)(-2A) + o(A)] =-2A + o(A) < for small A > 0. □ 

Theorem 6.6. The minimum volume covering ellipsoid problem for the slab B a p can be 
formulated as the nonlinear programming problem 

min — In a — (n — l)ln b, 

,a + [3 

B-t. ar-(—)(a-b)=0, ^ 

ar 2 + b - 1 - a(3(a - b) = 0, 
- a + b < 0, 

whose solution is the same as the one given in Theorem \6.1[ 



Proof. It follows from Lemma 15.31 that the optimal ellipsoid E(X, c) has the form X = 
diag(a, b, . . . ,b) and c = re\. Thus, the feasibility condition B a p C E(X, c) translates into 
the condition that the quadratic function 

g(u) = a(u - t) 2 + 6(1 - u 2 ) - 1 
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is non-positive on the interval I — [a,0\. Furthermore, Corollary 12.51 implies that there 
exist at least two contact points, which translates into the condition that the quadratic 
function g(u) takes the value zero at two distinct points in the interval /. A moment's 
reflection shows that g(u) must take the value zero at the endpoints a and /3. Consequently, 
the function g is a non-negative multiple of the function [u — a)(u — (3), that is 

g(u) + fi{u — ot)(j3 — u) — 0, for some // > 0, 

giving a — b — fi > 0, (a + (3)fi — 2ar = 0, and ar 2 + b — 1 — a/3/j, = 0. If we eliminate 
H from these constraints, we arrive at the optimization problem (|6.10[) . We have for it the 
Fritz John optimality conditions (for ordinary nonlinear programming) 

A H T ^— ) + A 2 (r -a/3)-\ a = —, 

2 a 

A (n- 1) (6.11) 
Ai( — - — ) + A 2 (l + a[3) + A 3 = , v ' 

Ai + 2A 2 t = 0, 

for some (A , Ai, A 2 , A 3 ) ^ 0, A > 0, A 3 > 0, and satisfying A 3 (a — b) = 0. 

Adding the first two equations above and substituting \\ — — 2A 2 r from the third 
equation gives 

A 2 (l-T 2 ) = ^ + Mi_i). 

a b 

If Ao = 0, we would have A 2 (l — t 2 ) = 0, and since r ^ ±1, A 2 = 0, and eventually 
Ai = = A 3 , a contradiction. Thus, we may assume that Ao = 1. 



2 . 1 n-1 



Then the above equation gives 

A 2 (l-r 2 ) = - + ^-^. (6.12) 
a b 

We solve for the decision variables (a, 6, r) discussing separately the cases a = b and 
a ^ b in the above optimality conditions. If a = b, then the first constraint in (|6.10[) gives 
r = and the second constraint gives b = 1. It remains to prove that a(3 < — 1/n. The 
equation (|6.12p gives A 2 = n, and Ai = — 2A 2 r = 0. Substituting these in the first equation 
in (|6.1ip yields < A 3 = —naf3 — 1, that is, a/3 < —1/n. 

We now consider the case a ^ b but a + (3 — 0. The first constraint in problem (|6 . 10[) 
gives r = and the third one gives A 3 = 0. Consequently, the first two conditions in (|6.1ip 
can be written as A 2 /3 2 a = 1 and A 2 (l — P 2 )b = n— 1, respectively, and the second constraint 
in (|6.10p gives /3 2 a+(l — (3 2 )b = 1. These imply A 2 = n, and a = 1/n/? 2 , b = ra ("Z^2) ■ Lastly, 
the condition a> b gives a(3 > —1/n. 

Finally, we treat the case a > b and a + j3 ^ 0. Again we have A 3 = and 

-t 2 + {a + (3)t - a/3 _ b (3 + a - 2t 



(n-1)- 



■((3 + a)r + (1 + a/3) a (3 + a 



Here the first equality is obtained by dividing the first equation in (|6.11[) by the second one 
and substituting Ai = — 2A 2 r, and the second equality follows from the first constraint in 
problem (|6.10[) . Consequently, r satisfies the quadratic equality 

(n + l)(a + (3)t 2 - [n(a + f3) 2 + 2(1 + a(3)}r + (a + /?)(! + na(3) = 0, (6.13) 
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which is the same equation as (|6.9|) in the proof of Lcmma l6.4l Following similar arguments, 
we find that 

= n(a + /3) 2 + 2(l + a/3)-VA 
T ~ 2(n+l)(a + /3) 

Solving the first and the second constraints in (|6.10[) for a, say by Cramer's rule, we find 

_ a + (3 

°~ (a + /3)(r 2 + l)-2T(l + a/3)" 

It follows from (|6 . 13[) that the denominator on the right hand side of the expression above 
equals 



n(a + f3) 2 r - n(a + (3)t 2 - n(a + (3)a(3 = n(a + /3)(t - a){/3 - r). 



This gives 



1 a + (3 - 2t 

b = - — a. 



i(t — a) (/3 — r) ' a + (5 



Finally, the inequality a(3 > — 1/n follows from the same argument at the end of the proof 
of Lemma 16.41 □ 



7 Determination of the maximum volume inscribed el- 
lipsoid of a slab 

In this section, we give explicit formulae for the maximum volume inscribed ellipsoid of 
the slab B a p in (|5.1[) using a semi-infinite programming approach. Without any loss of 
generality, we again assume throughout this section that 1 > a 2 . 

It is convenient to set up this problem as the semi-infinite program 

min{-lndet(A) : Ay + c £ B af} , Vy : = 1}, 

in which we represent the inscribed ellipsoid as E = c + A(B n ) where A is a symmetric, 
positive definite matrix with vol(-E) = w„det A. Lemma 15.31 implies that the optimal ellip- 
soid has the form A = diag(a, 6, . . . , b) and c = re± for some a > 0, b > 0, and r in R. 
Writing y = (u, z) in R x R™ -1 , we can replace the above semi-infinite program with a far 
simpler one 

min — In a — (n — 1) In 6, 
s. t. — au — t < —a, 

au + r < (3, 

{au + r) 2 +b 2 (l -u 2 ) < 1, 

in which the decision variables are (a, 6, r) and the index set is [—1,1]. 

We make some useful observations before writing down the optimality conditions for 
Problem [77T] Theorem 12.11 implies that the optimality conditions will involve at most three 
active constraints with corresponding multipliers positive. Clearly, the first constraint above 
can be active only for u = — 1 and the second one for u = 1. 



VuG [-1,1] 



(7.1) 
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Next, if a > — 1, we claim that the third constraint is active for at most one index value 
u. Otherwise, the quadratic function 



g(u) := (au + rf + b 2 (l - u 2 ) - 1 

is non-positive on the interval [—1,1] and equals zero for two distint values of u. If the 
function g{u) is actually a linear function (a=b), then it is identically zero; otherwise, it 
is easy to see that g{u) must equal zero at the endpoints —1 and 1. In all cases, we have 
g(— 1) = g{X) = 0, so that (—a + t) 2 = 1 = (a + t) 2 . This gives a — r = 1 = a + r, since the 
other possibilities give a = or a = — 1. But then a = 1, r = 0, and 1 = a — r < —a, where 
the inequality expresses the feasibility of the first inequality in Problem 17.11 We obtain 
a = — 1 and (3 = 1, a contradiction. The claim is proved. 

The following theorem is another major result of this paper. 

Theorem 7.1. The maximal inscribed ellipsoid IEi(B a p) has the form E = c + A(B n ) 

where c = re\, a < r < 0, A — diag(a, 6, . . . , b) with a > 0, b > 0, satisfying the following 

conditions: 

(i) If a = —ft, then 

r = 0, a = P, 6=1. (7.2) 
(it) IfAn{l-a 2 ) < (n + l) 2 (/3 2 - a 2 ), then 

_ a + ^Ja 2 + 4n(l - a 2 )/(n + l) 2 
T= 2^ ' (7.3) 

a = t — a, b 2 — a(a + nr), 

(m) I/4n(l - a 2 ) >(n + l) 2 (/3 2 - a 2 ) and a ^ -f3, then 

fi + a (3 -a 

/3 2 -a 2 V 




(7.4) 



Proof. If a = —1, then (3 = 1 and the optimal ellipsoid is the unit ball B n , which agrees 
with (i) of the theorem. We assume in the rest of the proof that a > — 1. 

We saw above that each of the constraints in 17.11 can be active for at most one value of 
u in [—1,1], and the first and second constraints for u = — 1 and u = 1, respectively. Then 
Theorem 12.11 gives the optimality conditions 

Ai + A 2 + 5{au + t)u = — , 

a 

xt.n z\ ( n ~ r- r 1 ii ( 7 - 5 ) 

oo(l -u ) = , u G [— 1, 1], v y 

-Ai + A 2 + 5(aw + r) = 0, 

where the non-negative multipliers (Ao, Ai, A 2 , 6/2) ^ correspond to the objective func- 
tion, and the first, second, and third (active) constraints in 17.11 respectively. 

Our first claim is that Ao > 0. Otherwise, the second equation in (|7.5|) gives S = or 
u = =pl. If S = 0, then the first equation gives the contradiction Ai = A 2 = 0. If u = —1, 
then we have =pl = a — r < —a < 1, implying a — r = — 1 or t = a + 1 > 1, another 
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contradiction. If u = 1, we have =pl = a + t. If a + r = 1, then the first equation in (|7.5|) 
gives Ai + A2 + 8 = 0, that is, Ai = A2 = 8 = 0, a contradiction. If a + t = — 1, then 
r = —a — 1 < — 1, yet another contradiction. The claim is proved. We set Ao = 1. 

The second equation in (|7.5[) gives 8 > 0, and that g(u) = for some u in (—1, 1) and 
negative elsewhere on [—1,1]. Note this means that u is the global maximum as well as the 
unique root of g on R, leading to the conditions 

b>a, u =-??—, b 2 r 2 = (l-b 2 )(b 2 -a 2 ), (7.6) 
£r — a z 

where the last equation expresses the fact that the discriminant of g equals zero. We will 
also have occasion to use the equality 

aT b 2 r 
au + T — a—: 7 + r = — = tt. (7.7) 

b z — a 2 - b z — a z 

Our second claim is that 

-a + r = a. (7.8) 

If not, then —a + r > a, Ai — 0, and the third equation in (|7.5p gives A2 = —8(au + t). 
Substituting this into the first equation in (|7.5[) leads to 5(au + r)(w — 1) = a -1 , and 
consequently toati + r < 0. But then A2 > and a + r = (3, which together with — a + r > a 
gives t > (a + 0)/2 > 0, that is, r > 0. Moreover, the second equation in (|7.6[) gives it > 0, 
and this leads to the contradiction that — A 2 = 8(au + t) > 0. The claim is proved. 



Next, we have 



1 , r, w . n — 1 6 r 

2A 2 = 5(au + t)(u + . 

a 



6 2 (l-u 2 ) 6 2 
(n — l)r (11 — l)u 



{1 - u){b 2 - a 2 ) o(l -u)' 

where the first equality is obtained by adding the first and third equations in (|7.5p . the 
second equality follows by substituting the value of 8 from the second equation in (|7.5|) 
and the value of au + t from (|7.7j) , and the last equality follows by substituting the value 
r/(b 2 — a 2 ) = u/a from the first equation in (|7.6p . Therefore, 

1 — nu 

A2 



2o(l - u)' 

Consequently, u < 1/n and u = 1/n if and only if A2 =0. Furthermore, we have u > 0: 
if u < 0, then r < by virtue of the first equation in (|7.6p . so that au + t < 0. But then 
the third equation in l|7.5p gives A 2 > Ai > 0, implying a + r = [3. This and (|7.8p give 
r = (a + /3) > 0, a contradiction. Therefore, 

0<M<-, and u=- A 2 = 0. (7.9) 

n n 

We can now prove part (i) of the theorem. We first show that 

u = 4==^. a = —(3. 



If a — — fl, then Lemma 15.31 implies that r = 0, which in turn implies u = 0. Conversely, 
if u = 0, then r = and A2 > 0, and we have —a + r = a and a + r = /3, leading to 



= r = {(3 + a)/2. Consequently, Ij7.8|) gives a — ~a — (3 and the second equation in (|7.6|) 
gives 6 = 1. 

We now consider the remaining cases < u < 1/n. We note that 

u(b 2 T 2 ) 

(au + t)ut = — = u(l — b 2 ) = u(l — a 2 ) — ar, 

fr — a A 

where the first equality follows from (|7.7[) and last two equalities from (|7.6p . leading to a 
quadratic equation for u, 

{ar)u 2 - (1 - a 2 - t 2 )u + ar = 0. (7.10) 

Define e > such that a + r = (3 — e =: j3 e . The equation cl + t = (3 e together with equation 
—a + t = a from (|7.8[) give 

r= — ^ — >0, a= — ^ — >0. (7.11) 

Substituting these in (|7.10p gives another quadratic equality for u, 

(/3 2 - a 2 )u 2 - 2(2 - 01 - a 2 )u + (/3 2 - a 2 ) = 0. (7.12) 
It is easy to verify, using (|7.9|) , that 

1 



A 2 >0 0<u<- <S=4> (n + 1) 2 (/3^ - a 2 ) <4n(l-a 2 ), 

n 

A 2 = <S=^ u = - ^ (n+ l) 2 (f£ - a 2 ) = 4n(l - a 2 ), 
n 



(7.13) 



Here the second equivalence on the first line follows because the quadratic equation for u 
in (|7.12p has negative value at u = 1/n. Since the leading term of the quadratic function is 
positive, its two roots r 1 < r 2 are positive, their product is 1, < r\ < 1/n, and r 2 > n. 

We now make our third and important claim that 

a + T<(3 iff 4n(l - a 2 ) < (n + l)(/3 2 - a 2 ). (7.14) 

On the one hand, if a + T = fl, then e = and (|7.13|) gives (n + l)(/3 2 — a 2 ) < 4n(l — a 2 ) or 
(n+l)(/3 2 — a 2 ) = 4n(l — a 2 ), depending on whether A 2 > or A 2 = 0, respectively. In either 
case, we have (n + l)(/3 2 — a 2 ) < 4n(l — a 2 ). On the other hand, if a + r < /?, then A 2 = 0, 
e > 0, and (|77T5|) gives (n+ l)(/3 2 - a 2 ) = 4n(l~a 2 ), that is, 4n(l-a 2 ) < (n+ l)(/3 2 -a 2 ). 
The claim is proved. 

The computation of the decision variables (a, 6, r) in the cases (ii) and (iii) now becomes 
a routine matter. If 4n(l — a 2 ) < (n + 1) 2 ((3 2 — a 2 ), then (|7.14p implies a + t < f3, and we 
have A2 = 0, u = 1/n. Substituting the value a = r — a from (|7.8p into (|7. 10[) gives the 
quadratic equation for r, 

(n + 1) 2 t 2 - (n + l) 2 ar - n(l - a 2 ) = 0. 
Since r > 0, the feasible root is given by 



r = (a + Va 2 + 4n(l - a 2 )/(n + l) 2 )/2. 
Then (|T.8[) gives a = r — a and (|T.6[) gives 1/n = u = (ar) /(b 2 — a 2 ), that is, b 2 — a 2 + nar. 
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If 4n(l - a 2 ) >(n + l) 2 (/3 2 - a 2 ), then (17141) and (171^1 give 

u = W~^ 2 ' 

It is easy to verify that the condition u < 1 is equivalent to ^/l — /3 2 =p yl — a 2 < 0, which 
is impossible if we choose the plus sign. Thus the feasible root is the one with the negative 
sign. Finally, the first equation in (|7.6p gives b 2 = a 2 + (ar)/u, or more explicitly the formula 
for b 2 in 1(731) . □ 

We end this section by reducing the semi-infinite program (|7.1I) to an ordinary nonlinear 
programming problem. However, we do not attempt to solve the resulting program in order 
to save space. As we already noted, the linear constraints, the first two inequalities in 
problem (|7.ip . simply reduce to the constraints a — r < —a and a + r < [3. In order to 
reduce the quadratic inequality system to a set of ordinary inequalities, we use a theorem of 
Lukacs characterizing the class of non-negative polynomials on a given interval. A simple 
inductive proof of Lukacs 's Theorem can be found in [7J. For a quadratic polynomial q(u) 
on the interval / = [a, b], this theorem states that q is non-negative on I if and only if there 
exist scalars a, 0, and 7 > such that 

q(u) = (au + (3) 2 + 7 ( u -o)(6-u). (7.15) 

Since the proof is short and simple in this case, we give it, following |7J. Note that the 
polynomial p(u) := q(u) — I (u) 2 , where I (u) — [sj q(a) (u~b) + ^/ q(b)(u — a)]/ (b—a), satisfies 
p(a) = = p(b), so that there exists a constant 7 such that p(u) — j(u — a)(b — u). Clearly, 
(|7.15| holds true if we can show that 7 > 0. Note that 1(a) < and 1(b) > so that l(u) = 
d(u — r) for some constant d and r € [a, b]. We have < q(u) — d 2 (u — r) 2 +"f(u — a)(b — u), 
or 

— d 2 (u — r) 2 < j(u — a)(b — u), Vu 6 [a, b]. 

If r is in (a, 6), then choosing u — r gives 7 > 0. If r = a or r = 6, then choosing u near r 
gives 7 > 0. This completes the proof. 

We remark that the result we just proved also follows from the one dimensional case of 
the S-procedure, see [3T] or [3D]. 

Applying this result to our quadratic function —q(u) = — (au + r) 2 — b 2 (l — u 2 ) + 1 which 
is non-negative on the interval [—1, 1], we see that there exists scalars c, d, and 7 > such 
that 

(au + t) 2 + b 2 (l -u 2 )-l = -(cu + d) 2 + 7 (> 2 - 1), 

that is, a 2 — b 2 + c 2 — 7 = 0, ar + cd = 0, and b 2 + r 2 + d 2 + 7 = 1. Therefore, the problem 
of finding IE(i? Q( g) reduces to the nonlinear programming problem 

min — In a — (n — 1) In 6, 
s. t. — a + t > a, 

a + r < P, 

a 2 - b 2 + c 2 - 7 = 0, 

ar + cd = 0, 

b 2 + T 2 +d 2 + 1 = 1, 

7>0, 

in which the decision variables are (a, 6, r, c, d, 7) and a, (3 are parameters. 
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8 Determination of the minimum volume covering el- 
lipsoid of a truncated second order cone or a cylinder 



In this section, one of the problems we are interested in is finding the minimum volume 
ellipsoid covering the truncated second order cone 

K = {x — (x!,x) e M x W 1 - 1 : \\B(x- c)|| <x u a < x x < b}, 

where B is an invertible matrix in R(™~ 1 ) >< (™ _1 ) and < a < b are constants. By an affine 
change of x, we may assume that c = and B = I n -\- We claim that by further affine 
change of variables, we can reduce the convex body K to have the form 

Q a p := conv(5 a U Sp), 

where S a := {x e B n : x± — a}, Sp := {x G B n : x\ — /?}, and —1 < a < (3 < 1. Consider 
the ball B in R n with center (a + b)e Y and radius \J a 2 + b 2 . The slice P a := {(a,x) £ 1™ : 
\\x\\ < a} C K lies in B, since ||(a,S) - (a + 6,0)|| 2 = b 2 + \ \x\\ 2 < a 2 + b 2 , and similarly 
Pb C -B. A further translation and then scaling transforms B into i?„. This proves the claim. 
Conversely, if —a ^ /3, Q Q /3 can be viewed as a truncated second order cone. 

The other problem we are interested in is finding the minimum volume ellipsoid cov- 
ering a cylinder. If —a = (3, then Q a p is clearly a cylinder. Conversely, any cylinder can 
be transformed into such a Q a f3 by an affine transformation. Consequently, the CE(Q Q /3) 
problem formulates both problems at the same time. 

Theorem 8.1. The ellipsoid CF,(Q a p) has the form E(X,c) where c = re\ and X = 
diag(a, 6, . . . , b), where the parameters a > 0, b > 0, and a < t < (3 are given as follows: 

(i) If a (3 = —1/n, then 

t = 0, and a = b = 1. (8.1) 

(ii) If a + (3 = and a(3 ^ —1/n, then 

1 , n- 1 



r = 0, 



n/3 2 ' n(l-f3 2 )' 
(Hi) If a + (3 ^ and a(3 ^ —1/n, then 

_ n(/3 + a) 2 + 2(l + a / 3)-y / A 
T ~ 2(n+l)(/3 + a) 

1 b _ 1 -a(r-a) 2 



(8.3) 



n(r — a)((3 — t] 
where A = n 2 {(3 2 - a 2 ) 2 + 4(1 - a 2 )(l - f3 2 ). 

Proof. It is clear that Aut(Q Q| g) D Aut(B Q/ g), so that Lemma [5.3l implics X = diag{a, b, 6) 
and c = rei with a > 0, b > and r G R. Let {ui}^ =1 be the contact points of the optimal 
ellipsoid with Q a @. Writing m — (yi, zi) € Ixl™" 1 , we have that yi is either a or (3, and 1 = 

INI 2 = y~i + \\ z i\\ 2 i that is > ll^ll 2 = !-y 4 2 - Since {u l -c){u l -c) T = 

Theorem 12.21 yields the following optimality conditions: 



(Vi - t) 2 {yi - r)zf 
(yi - r)zi Zizf 
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t= -y^XiVi, = ^A l z l , ^ Xj = n, (8.4) 

i— 1 i— 1 i=l 

- = X^A t (y l -t) 2 , = ^Ai(j/i-r)^, r-^n-i = ]T] A^zf, (8.5) 

2—1 2—1 2—1 

0= a(yi-r) 2 +6(l-^ 2 )-l, i=l,...,k, (8.6) 
0> a(y-r) 2 +6(l-y 2 )-l, for ye {a, /?}. (8.7) 

Here the last line expresses the feasibility condition Q a p C E(X,c), since Q Q/ 3 C E(X,c) if 
and only if <9S Q U CE(I,c), 

The conditions (|8.4p — (|8.7p characterize the ellipsoid CE(Q Q/ g). Since the ellipsoid is 
unique, its parameters (a, b, t) are unique and can be recovered from the above conditions. 

The same arguments in Lemma 16.41 applies here and gives the equation 



l = a(a-T) 2 + b(l-a 2 ), T =(l--)-^-, - = n(r - a)(/3 - r), (8.8) 

\ a J 2 a 

which we again use to compute the variables a, b, and t. If a = b in the optimal ellipsoid, 
(|8.8p immediately gives r = 0, a = b = 1, and a/3 = —1/n. 

Next, if a ^ b and a — —0, then (|8.8p gives r = 0, a = l/(na 2 ) and b — (n — 
l)/(n(l — a 2 )). Furthermore, if we have a = 1 = no 2 , then we obtain a contradiction since 
b = (n — l)/(n — 1) = 1 = a. Thererefore, a / 1 and the last equation in (|8.8j) gives 

a/3 7^ 



Lastly, a ^ b and a ^ — /3, then the same argument in Lemma 16.41 gives the quadratic 
equation (|6.9p for r, 



(n + l)(a + (3)t — (n(a + /3) 2 + 2(1 + a/3)) r + (a + /3)(1 + na/3) = 0, 

and the resulting equations in (|8.3[) for (a, b, r). Since the middle equation in (|8.8p implies 
t ^ 0, we have 

^ (n(a + (3) 2 + 2(1 + na/3)) 2 - A = 4(n + l)(a + /3) 2 (1 + na/3), 
that is, a/3 7^ —1/n. □ 
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